DSIR: the First TREC-7 Attempt

نویسنده

  • Arnon Rungsawang
چکیده

This paper describes our first large-scale retrieval attempt in TREC-7 using DSIR. DSIR is a vector space based retrieval system in which semantic similarity between words, documents and queries, is interpreted in terms of geometric proximity of vectors in a multi-dimensional space. A co-occurrence matrix computed directly from the collection is used to build the underlying semantic space. We have implemented DSIR on a cluster of lowcost PC Pentium-class machines, and chosen the PVM message-passing library to manage our distributed DSIR version. Although our first adhoc retrieval results are quite poor in terms of recall-precision measure, we believe that more work and experiments have to be explored in order to obtain more promising retrieval performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Co-Operative DSIR Text Indexing System

The unceasing development of the Internet technology currently revolutionizes the way we look for relevant information. Since the number of web pages is uncountable, and very disorganized, a powerful searching tool like Information Retrieval (IR) system is needed. In this paper, we propose a co-operative indexing system called “DSIR”. Co-operative DSIR is a full text vector space based indexing...

متن کامل

WaterlooClarke: TREC 2015 Contextual Suggestion Track

In this work we present a first attempt at developing a live system to solve the problem presented in the TREC 2015 contextual suggestion task. The goal of this task is to tailor point-of-interest suggestions to users according to their preferences [3]. We present how we gathered data for the candidate pointsof-interest, filtered some of the candidates and built a live system to return suggesti...

متن کامل

DSIR: Assessing the Design of Highly Potent siRNA by Testing a Set of Cancer-Relevant Target Genes

Chemically synthesized small interfering RNA (siRNA) is a widespread molecular tool used to knock down genes in mammalian cells. However, designing potent siRNA remains challenging. Among tools predicting siRNA efficacy, very few have been validated on endogenous targets in realistic experimental conditions. We previously described a tool to assist efficient siRNA design (DSIR, Designer of siRN...

متن کامل

JHU/APL at TREC 2001: Experiments in Filtering and in Arabic, Video, and Web Retrieval

The outsider might wonder whether, in its tenth year, the Text Retrieval Conference would be a moribund workshop encouraging little innovation and undertaking few new challenges, or whether fresh research problems would continue to be addressed. We feel strongly that it is the later that is true; our group at the Johns Hopkins University Applied Physics Laboratory (JHU/APL) participated in four...

متن کامل

Experiments in Query Processing at LEXIS-NEXIS for TREC-7

The purpose of this report is to provide an overview of LEXIS-NEXIS’ entries to the TREC-7 competition. The report will describe the experiments we conducted, the results we obtained, and our future research directions. The report is divided into three sections. The first section describes the experimental setup and gives a brief account of some of the research activities that led to the TREC-7...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998